Applying Back Propagation Algorithm for classification of fragile genome sequence
نویسندگان
چکیده
Most frequently occurring recurrent chromosomal translocation allied with all subtype of leukemia are available in Mitel Mann Data base. We have retrieved about 55 such genome sequence from TIC dB data base with 100% similarity score and got noncoding sequence of chromosome 9 and 22 as positive example of fragile site. Another 55 housekeeping genome sequence is taken for classification purpose. For content based analysis we have extracted 20 features of frequency density of mono nucleotide and dinucleotide. The network is designed by determining hyper parameters like number of hidden layer, hidden neurons and input features. First we took 20 input features and there after 16 for reducing number of free parameters (i.e. weight space). Network is also pruned for succeeding experiments. The training strategy was also exhaustively explored, based on literature study and trial and error heuristic methods to achieve more and more accuracy. Regularization is also employed by cross validation and early stopping. We have achieved 95% accuracy for training data and 70% to test data in first experiment. To avoid this over fitting at last we could achieve 93% over all accuracy and outlier detection, too. We could be able to show that dinucleotide frequency density is important statistical feature for classifying genome sequence. This classifier can show the probability of fragility to occur in genome sequence at very early stage so as to deal with the diesis at prognosis phase.
منابع مشابه
Classification of ECG signals using Hermite functions and MLP neural networks
Classification of heart arrhythmia is an important step in developing devices for monitoring the health of individuals. This paper proposes a three module system for classification of electrocardiogram (ECG) beats. These modules are: denoising module, feature extraction module and a classification module. In the first module the stationary wavelet transform (SWF) is used for noise reduction of ...
متن کاملUsing Artificial Neural Network Algorithm to Predict Tensile Properties of Cotton-Covered Nylon Core Yarns
Artificial Neural Networks are information processing systems. Over the past several years, these algorithms have received much attention for their applications in pattern completing, pattern matching and classification and also for their use as a tool in various areas of problem solving. In this work, an Artificial Neural Network model is presented for predicting the tensile 
properties of ...
متن کاملUsing Artificial Neural Network Algorithm to Predict Tensile Properties of Cotton-Covered Nylon Core Yarns
Artificial Neural Networks are information processing systems. Over the past several years, these algorithms have received much attention for their applications in pattern completing, pattern matching and classification and also for their use as a tool in various areas of problem solving. In this work, an Artificial Neural Network model is presented for predicting the tensile properties of co...
متن کاملOn the use of back propagation and radial basis function neural networks in surface roughness prediction
Various artificial neural networks types are examined and compared for the prediction of surface roughness in manufacturing technology. The aim of the study is to evaluate different kinds of neural networks and observe their performance and applicability on the same problem. More specifically, feed-forward artificial neural networks are trained with three different back propagation algorithms, ...
متن کاملClassification of encrypted traffic for applications based on statistical features
Traffic classification plays an important role in many aspects of network management such as identifying type of the transferred data, detection of malware applications, applying policies to restrict network accesses and so on. Basic methods in this field were using some obvious traffic features like port number and protocol type to classify the traffic type. However, recent changes in applicat...
متن کامل